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Note on a Family of Monotone Quantum Relative Entropies 
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Abstract 

Given a convex function ip and two hermitian matrices A and B , Lewin and Sabin 
study in [1] the relative entropy defined by "H(A,R) = TV [<p{A) — <p(B) — — B)}. 

Amongst other things, they prove that the so-defined quantity is monotone if and only if 
ip' is operator monotone. The monotonicity is then used to properly define T-L(A,B ) for 
bounded self-adjoint operators acting on an infinite-dimensional Hilbert space by a limit¬ 
ing procedure. More precisely, for an increasing sequence of finite-dimensional projections 
{Pnj^Li with P n —> 1 strongly, the limit \\m. n ^ f00 / H(P n AP ril P n BP n ) is shown to exist 

and to be independent of the sequence of projections {P n }^Li- The question whether 
this sequence converges to its ’’obvious” limit, namely TV [p{A) — ip{B) — ip'{B){A — B)\, 
has been left open. We answer this question in principle affirmatively and show that 
lim n _>oo 'H(P n AP n , P n BP n ) = Tr [ip(A) - ip(B) - -^ip(aA + (1 - a)B) | a=0 ]. If the opera¬ 
tors A and B are regular enough, that is (A — B ), tp(A) — <p{B) and ip'(B)(A — B) are trace-class, 
the identity Tr \<p(A) - <p(B) - ^<p (aA + (1 - a)B) | Q=0 ] = Tr [ip(A) - ip(B) - ip\B)(A - R)] 
holds. 
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I. INTRODUCTION AND MAIN RESULTS 


We start with a quick review of the setting and the results of [fj that are of interest for us. Let 
ip £ C° ([0,1],R) be a continuous, convex function such that <p' is continously differentiable 
on (0,1) and let A and B be two hcrmitian matrices with 0 < A, B < 1. Lewin and Sabin 
define a family of relative entropies of A with respect to B by the formula 

H(A, B) = Tr [y(A) - V (B) - V '(B)(A — B)]. (1) 

As long as 0 < B < 1, the above expression is well defined. If 0 and/or 1 are contained in 
the spectrum of B and if (p is not differentiable at these points this is still true if A = B 
on Ker(H), Ker(l — B) or Ker(H) © Ker(l — B ), respectively (the trace is taken on the 
complement of these subspaces). Are the just mentioned conditions not fulfilled, they define 
H{A,B) = oo. 

In [1, Theorem 1], the authors show that the so-defined relative entropy is monotone if and 
only if tp' is operator monotone. We quote: 

Theorem 1. (Monotonicity). Under the above conditions, the following are equivalent 

1. p>' is operator monotone on (0,1); 

2. For any linear map X : h± —> I 12 on finite-dimensional spaces hi and h 2 with X*X < 1, 
and for any 0 < A, B < 1 on hi, we have 

U{XAX *, XBX*) < FL{A, B ), (2) 


with FL(A, B) defined in Eq. (JT() . 

In a second step, this result is used to extend the definition of the relative entropy to self- 
adjoint operators acting on an infinite-dimensional separable Hilbert space h via the formula 

H(A,B) := lim H(P n AP n ,P n BP n ), (3) 

n—>00 

where {P n }n Li i s an increasing sequence of finite-dimensional projections with P n —» 1 in 
the strong operator topology. By C(h) we denote the set of bounded linear operators on h 
and h\, ho denote infinite-dimensional separable Hilbert spaces. We quote again: 
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Theorem 2. (Generalized relative entropy in infinite dimension). We assume that p e 
C° ([0,1],R) and that ip' is operator monotone on (0,1). 

1. (H is well defined). For an increasing sequence P n of finite-dimensional projections 
on h such that P n —> 1 strongly, the sequence l-l(P n AP n , P n BP n ) is monotone and 
possesses a limit in R + U {+oo}. This limit does not depend on the chosen sequence 
P n and hence FL(A, B ) is well-defined in R + U {+oo}. 

2. (Approximation). If X n : hi —$■ h 2 is a sequence such that X*X n < 1 and X*X n —* 1 
strongly in hi, then 

U{A,B)= lim H{X n AX n ,X n BX n ). (4) 

n—>oo 

3. (Weak lower semi-continuity). The relative entropy is weakly lower semi-continuous: 
if 0 < A n , B n < 1 are two sequences such that A n A and B n B weakly-* in C{h), 
then 

H(A,B) < lim inf H(A n ,B n ). (5) 

n—>oo 


As one would expect, FL{A,B) can take finite values when A and B themselves are not 
compact, as the following upper bound shows [1. Theorem 3]: 

H(A,B)<CTr(J 1 + j r f^Aj(A-Bf. ( 6 ) 

Note that the only dependence on ip on the right hand side of Eq. (JSJ) is in the constant 
C. The question whether their notion of relative entropy in infinite dimensions is related to 
Tr (p(A) — <p{B) — tp'(B){A — B)\, which is a-priori well-defined when the operator under 
the trace is trace-class, has been left open by the authors. 


We answer this question in principle affirmatively, where ”in principle” stands for the fact 
that Tr [c p(A ) — <p(B) — (p'{B){A — B)\ turns out not to be the correct limit, in general. 


Theorem 3. Let (p G (7 0 ([0,1],R) be such that ip' is operator monotone on (0,1) and let 
{P n }(?=\ be defined as in Theorem d. Then 


lim H(P n AP n , P n BP n ) = Tr 


p)(A) - p>(B) - ~^-ip (aA + (1 - a)B) 
da 


a=0 


( 7 ) 


with the understanding that either both sides are finite and equal each other, or both sides 
are infinite. 
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Remark 1. We define the differential in Eq. © by the formula 
■£('f,ip(aA+(1 - a)B)'if)\ a=0 = lim Q _> 0 a -1 (if, [ip{aA + (1 - a)B) - p(B)} if). In 
case p' is continuous on [0,1], this limit exists for all Eh. If p' is not continuous on the 
whole interval, it has singularities at 0 and/or 1 (we remind that p' is monotone increasing 
and continuous on (0, 1) by assumption) and we have to distinguish between three cases. 
First, assume B has no eigenvalues at the points of discontinuity of p'. Then the above 
limit exists for all in a suitably chosen dense set D C h (see Section 1771 Lemma 0 for 
more details). Second, if B has an eigenvalue at a point of discontinuity of p', if is the 
corresponding eigenvector to the just mentioned eigenvalue and (A — B)if / 0, then the 
above limit equals —oo. Third, if (A — B)if = 0 in the just mentioned situation, the above 
limit is equal to zero. 

Remark 2. The operator monotonicity of the function p' implies the operator convexity of 
its primitives which in turn implies that p{A) —p{B ) — f^p (aA + (1 — a)B) | a=0 is positive 
(see Section [ 77 | for more details). This property can now be used to define a notion of 
trace that is applicable on the right hand side of Eq. (jTj) . Assume for the moment that B 
has no eigenvalues at points of discontinuity of p' with (A — B) / 0 on the corresponding 
eigenspaces. Then the symmetric operator p(A) — p(B) — f^p (aA + (1 — a)B) | a=0 can be 
defined on the dense set D mentioned in Remark 0 and since it is positive it has a Friedrichs 
extension (T,T>(T)). By restricting attention to bases {ep}°^ =1 where all eg lie in the form 
domain of T, we can define the trace of the operator on the right hand side of Eq. (JTj) to be 
( e / 3 ,Teg), see Section I7H for more details. The so-defined trace equals the usual trace 
whenever p(A) — p(B) — f^p (aA + (1 — a)B) | Q=0 is trace class and +oo otherwise. Now if 
B has an eigenvalue at a point of discontinuity of p' and {A — B) / 0 on the corresponding 
eigenspace Remark^ suggest to define the trace on the right hand side of Eq. (0) to be +oo. 
This goes hand in hand with the definition of the relative entropy for hermitian matrices of 
Lewin and Sabin which has been explained in the beginning of the introduction. 


Remark 3. The idea to define the relative entropy as a trace over a manifestly positive oper¬ 
ator in order to make it well-defined on a larger set, has already been used in J \2f. The formula 
in the just mentioned reference equals the trace over p(A) — p(B) — -Ap (aA + (1 — a)B) | Q=0 
with p{x) = xln(x) + (1 — x) ln(l — x ) and resembles Eq. (J8]). 


Remark 4. Assuming matrices, the equality Tr [p(A) — p(B) — p'(B)(A — B)\ = 
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Tr [if (A) - p(B) - £p (aA + (1 — a)B) | Q=0 ] holds as one can see with a direct compu¬ 
tation that exploits the cyclicity of the trace (see Theorem V.3.3] for a simple way to 

compute the derivative). In general however, this cannot be expected. 


A crucial ingredient of our proof of Theorem [3] is the following Lemma which we state here 
because it is of interest in itself. 


Lemma 1. Let p G C([0,1],M) be such that p' is operator monotone on (0,1)- Assume 
further that B has no eigenvalues at points of discontinuity of p' with {A — B) ^ 0 on the 
corresponding eigenspaces. Then there exists a constant b > 0 and a unique Borel probability 
measure fi on [— 1 , 1] such that 


Tr 

2b 


p(A) - p(B) —— p(aA + (1 - ol)B) 
da 


*1 r oo 


Tr 


'-1 Jo 


:Q 


a=0 

l 


( 8 ) 


Q- 


1 + A(1 - 2B) +1 1 + A(1 - 2A) +1 1 + A(1 - 2 B) + t 


dt dp( A), 


where Q = (A — B). (To be precise, /i is unique only ifb>0.) 


Remark 5. The formula on the right hand side of Eq. (JED is in many circumstances easier 
to handle than the formula on the left hand side. This is because it is the integral (with a 
positive measure) of a positive function which is the trace of a bounded positive operator. In 
particular, the operator under the trace has a simpler form than the one on the left hand 
side of Eq. (JED • 


Assuming more regular operators A and B, the equality mentioned in Remark |4] is still true 
in infinite dimensions as the following statement shows. 


Theorem 4. Let p G C'°([0,1], M.) be such that p' is operator monotone on (0,1). As¬ 
sume in addition that [A — B), p(A) — p(B) and p'(B)(A — B ) are trace-class. Then 
-f^p {aA + (1 — a)B) | a=0 is trace-class and the identity 


Tr 


p(A) - p(B) 


d_ 

da 


p (aA + (1 


c)B) 


Tr l v (A)- v (B)- v '(B)(A-B)] (9) 


holds. 


Remark 6. In mathematical physics one encounters applications where the state of a phys¬ 
ical system is defined to be a minimizer of a nonlinear functional in which the physical 
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relative entropy appears, see e.g. |j, y/. For a fermionic many-particle system the function 
tp(x) = xln(x) + (1 — x) ln(l — x) is the right choice to define the physical relative entropy, 
while for bosons it is tp(x) = a;ln(x) — (1 + x) ln(l + x ). We note that both functions fulfill 
the requirements of Theorems Since the right hand side of Eq. (JUJ) is in practice much 
easier to evaluate explicitly than the left hand side when given a trial state A, Theorem d 
becomes important if one wants to derive an upper bound for the minimal energy of such a 
functional. The left hand side of Eq. o in contrast is important since it allows one to prove 
upper or lower bounds for the relative entropy with the help of Klein’s inequality, see Jj/. 


II. PROOF OF THEOREM H 


The main ingredient of our proof is the derivation of the formula stated in Lemma d Having 
this identity at hand, we show the convergence of the relative entropy by first showing it for 
the trace under the integral. In a second step, we argue why the limit can be interchanged 
with the integrals over A and t. At this point Theorem [1] enters the analysis in a crucial way. 


Since cp' is operator monotone on (0,1) there exists a unique Borel probability measure p 
on [—1,1] such that (see 4, Corollary V.4.5]) 

2.x- - 1 


<p'(x) = a + b 


-d/i(A), 


( 10 ) 


Li 1 - A(2x - 1) 

with b > 0 (To be precise, p is unique only if b > 0.). When integrating the above expression, 
one obtains a primitive for ip' which is of the form 


(p(x) — ax + c — 


i-i 


2x — 1 In (1 + A(1 — 2x)) 


A 


A 2 


dp(\). 


( 11 ) 


Since 14 - ln(x) is an operator convex function, the same holds true for <p. 


To keep the main argumentation straight, we first prove two technical Lemmata. The first 
concerns the relation between the regularity of ip at the endpoints of the interval [0,1] and 
the behavior of the measure p in the vicinity of —1 and 1. 

Lemma 2. Assume p G (7 o ([0,1],R) such that ip' is operator monotone on (0,1). Then 

M{-i}) = o = /i({i}), 

,1 /- 1/2 

/ — ln(l — X)dp(X) < oo and / — ln(l + X)dp(X) < oo. (12) 

J 1/2 J- 1 
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(13) 


If in addition p' G C°([0,1],M), the stronger implications 


'1/2 


1-A 


d/i{ A) < oo and 


,- 1/2 


'-1 


1 + A 


dp( A) < oo 


hold. In case p' is not continuous at 1 the first integral in Eq. (fT3|) equals +oo and if it is 
not continuous at 0 this is true for the second integral. 


Proof. We start with the first case, hence we assume that only p is continuous on [0,1]. Since 
the limits lim^o p(x) and lim^i p(x) exist we can conclude that /i({—1}) = 0 = /x({l}) 
holds. We further conclude that the following limit exists (see Eq. OB) 


oo > lim 


- 1 ln(l + A(1 — 2x)) 


X —>1 


1/2 


A 2 


d/i(A) > - 


'1/2 


ln(l — A)d/i(A). 


(14) 


To come to the expression on the right hand side, we have applied Fatou’s Lemma. Doing the 

1 /9 

same argumentation again, this time with the limit x —> 0, yields J_ x ~ ln(l + A)d/r(A) < oo. 
If also <p' is continuous on [0,1], we compute 

1 


2x — 1 

limyw = a + 6lim x _ X{2x _ y MV > « + 


-d/i(A), 


(15) 


l-i 1 - A 

where, as before, we have applied Fatou’s Lemma. The same procedure with lim x ^o ~ ( p'( x ) 
yields the other bound. Using the monotonicity of the integrand, one easily shows that 
the just discussed integrals diverge to +oo in case p' is not continuous at 0 and/or 1, 
respectively. □ 


In order to obtain a handy formula for the operator f^tp {aA + (1 — a)B) | Q=0 , we explicitly 
compute the directional derivative. 

Lemma 3. Assume p G C°([0,1],M) such that <p' is operator monotone on (0,1). If p' is 
continuous on [0,1] or if it is discontinuous at 0 and/or 1 and B has no eigenvalue at these 
points then 


7 7 p 1 

— (V>, p(aA + (1 - a)B)if) = a {fj, (A - B)if) - - 
da o=o 2 J_ x 




A 


2 f°° 


ip. 


:(A - B)- 


if ) dt 


(16) 
dp{ A), 


, 0 v l + A(l-2 B)+V 1 + A(1 — 2B) + t 

where a,b and p are defined by Eq. (fTOlh In case of the first scenario (p' continuous on 
[0,1]/, the derivative is taken for all if G h while in the second scenario it is taken only for 
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all if in a dense set D C h. Explicitly, the set D is given by D = U e >ol(e < B < 1 — e)h in 
case 0 and 1 are points of discontinuity of p' and by the obvious generalization when p' is 
discontinuous only at one of these points. This accounts for the fact that the limiting operator 
may be unbounded. In case p f has discontinuities and B has eigenvalues at at least one of 
these points, we have to treat the above limit with if being one of the eigenvectors to the just 
mentioned eigenvalues separately. We distinguish between two cases. If ( A — B)if ^ 0 we 
have 

if^J = oo. 

If ( A — B)if = 0 instead, the limit in Eg. (Q~Tj) equals zero. 



lim ( if, 

a —>0 ' 


p(aA + (1 — aB)) — p(B ) 


a 


Proof. Using Eq. m, one can easily check the identity 


J- (4>, V {otA + (1 - a)B) if) =a (if, (A - B)if) - ^ lim 
da a=o 2 a->o 


-i 


^, 2(j4 a % ) (18) 


, , In (1 + A(1 — 2(S + o(n — S)))) — In (1 + A(1 — 2B)) , 
+ |^,-^-* 


d/i(A). 


The second term is just the difference quotient defining the directional derivative of the 
second term in Eq. CD- Let us have a closer look at the term with the logarithms. We use 
the formula In (a:) = J 0 °° dt and apply the resolvent identity once, to see that it 

can be written as 


In (1 + A(1 - 2 (B + g(A - B)))) - In (1 + A(1 - 2 B)) _ 


2 

A 


ip, 


aA 2 

1 


■(A — B)- 


(19) 
-if ) dt. 


1 + A(1 - 2 B) + t y ' 1 + A(1 - 2 (B + a (A - B ))) + t 

In order to explicitly compute the limit a —* 0, it needs to be interchanged in a first step 
with the integral over A and in a second step with the integral over t. The second step 
will follow easily from the estimates used to show the first step since A G (—1,1) is then 
fixed which implies that all resolvents are uniformly bounded. We therefore focus on the 
interchange of the limit a —» 0 with the integral over A. In order to be able to apply 
dominated convergence, we have to find a positive function g G L 1 (/i) with 




2 

A 


(if, R(B)(A - B)R(B + a(A - B))if) dt 


< 0(A) 


( 20 ) 
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for all 0 at least in a dense subset of h (The case where B has eigenvalues at points of 
discontinuity of p' will be treated at the end.). To shorten the writing, we have introduced 
the notation R{B ) = (1 + A(1 — 2B) + f) -1 . 

Let us first investigate the behavior of our integrand for A G (—1 + e, 1 —e). We write 
R(B) = — Y^y(l — 2 B)R(B) (and the same for R(B + a(A — B ))) and evaluate the 

contribution of the first term which reads 

~jf YTt ^ TTt dt = ^ _ ■ ( 21 ) 

It cancels the first term under the integral on the right hand side of Eq. (fT8]h The three 
remaining terms have no singularity and can be bounded by a constant. 


In the vicinity of A = — 1 and A = 1 the situation is a little different and one needs to argue 
more carefully. We will distinguish three cases depending on the regularity of p' at 0 and 1 
and on the spectrum of B. First let us assume that p' is not continuous at 0 and 1 and that 
B has no eigenvalues at these points. Let D e = 1 (e < B < 1 — e) h and define D = U e> o D e . 
Due to our assumptions on B, the set D is dense in h. For 0 G D, we investigate 




< 


1 +A(1 -2 B)+t 
1 


{A-B) 




1 + A(1 — 2(aA + (1 — a)B )) + t 

P-B|' 1 




d t 




( 22 ) 


dt 


l + A(l-2 B) + t r " 1100 1 + A(1 — 2(aA + (1 — o/)B)) + t 

which is the relevant contribution from Eq. (1201) . The part of the integral over t from say 1 
to oo is easy to control. One just bounds the resolvents in operator norm by 1 jt. After the 
evaluation of the integral, we end up with a constant. To bound the other part of the integral 


over t (the one from 0 to 1), we use the fact that -0 G D which implies that 
1/e for an e > 0 that depends on 0. On the other hand 1 

for A close to —1. Putting this together, we obtain 


i 


l+A(l-2 B)+t 


0 


< 


l + A(l-2 B)+t 


0 


A-B 


l+A(l-2(oA+(l-a)B))+t 


0 


0 


1 


1+A+f 


1 + A(1 - 2(aA + (1 - a)B)) + t 


df 


(23) 


<U-B | 


1 + A +1 


d t + C 


< C(e) (— ln(l + A) + 1). 


A similar bound can be obtained for A close to 1. There the function — ln(l — A) enters the 
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analysis. Hence, there exists a constant C(e) depending on ip such that 



3^,,) 



ip, R{B){A - B)R{B+a{A - B))ip jdt 


\ 


(24) 


<C( £ )(-ln(l-|A|) + l). 


Because of Lemma [21 the bound allows us to take the limit inside the integral and proves 
the claim in this situation. 


Nearly the same argumentation goes through when B has spectrum at 0 and/or 1 and if ip' is 
continuous at these points. By bounding both resolvents like we did with the second in the 
previous step, that is ||i?(H)-0|| < (1 — |A| + f) _1 and the same with || R{aA + (1 — a)B)ip\\, 
one obtains 



2 (A - B) 

A 



2 

A 


ip, R{B){A - B)R{B + a(A - B))ip d t 


< 


C 


1-|A| 


(25) 


Again due to Lemma [2], this is enough to interchange the limit and the integral. The case 
where ip f is discontinuous only at one point is treated in the obvious way. 


For the last case we have to assume that tp' is not continuous at 0 and/or 1 and that B has 
an eigenvalue at at least one of these points. We only investigate the relevant contribution. 
Let ip be the eigenvector of B to the eigenvalue 0 for example (the other cases go the same 
way). We will show that 


lim 

a —AO 


-1/2 _ x 


-1 


A Jo 


{ip, R{B){A - B)R(B + a(A - B))ip) d t d/i(A) = oo, 


(26) 


if ( A — B)ip ^ 0 and that the above limit equals zero in case ( A — B)ip = 0. Using 
R(B + <y(A — B)) = R(B) +2a\R{B + a{A — B)){A — B)R(B), the integrand can be written 


as 


{ip, R{B)(A - B)R{B)ip) - 2a {ip, R{B){A - B)R{B + a{A - B)){A - B)R{B)ip) 


1 + A +1 


-1 

T 


{ip, Aip) — 2 a {ip, AR{B + a{A — B))Aip) 


(27) 


Let us first assume that {A — B)ip ^ 0 which implies that {ip, Aip) > 0. Since the function 
t i— y j is operator convex on the interval (0, oo), see 4, Exercise V.2.11], we know that 
R{aA + (1 — a)B) < aR{A) + (1 — a)R{B). If we apply this inequality on the right hand 
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side of Eq. (|2TJ) and discard all positive terms in order to obtain a lower bound, we find 

2 r i 


1 + A + t 
> 


X 

1 


(ip, Aip) — 2a (ip, AR(B + a(A — B))Aip) 


( 28 ) 


2 r 


—— (ip, Aip) — 2a 2 (ip, AR(A)Aip) — 2a (ip, AR(B)Aip) 

A 


1 + A +1 j 

The right hand side of this equation, viewed as a function of ck, is certainly monotone and 
so we can use monotone convergence to show that 


/ — 1/2 noo 

-i Jo V i + x + t 


2 r 


-1 

T 


(ip, Aip) — 2a 2 (ip, AR(A)Aip) 

— 2a (ip, AR(B)Aip) 

1 


(29) 


d t d/i(A) 


f-i/2 /• 

= (V>, Aip) j — 


1 + A +1 


j -i 

r 1/2 i 

> (ip, Aip) J YTx dfi(yX ^ = °°' 


d t dfx(X) 


The last equality is achieved with the help of Lemma [2] Now assume that (A—B)ip = 0 which 
means that Aip = 0. Hence, [aA + (1 — a)B] ip = 0 and ip(aA + (1 — a)B)ip = <^(0)^. Since 
this expression is a constant the derivative with respect to a vanishes. A similar argument 
can be done when B has 1 as an eigenvalue. This concludes the proof of Lemma [3j □ 


Before we come to the main part of the proof, we have to argue how the trace on the right 
hand side of Eq. (JTJ) can be defined. Let us for the moment assume that B has no eigenvalues 
at points of discontinuity of ip' with (A — B) ^ 0 on the corresponding eigenspaces. Then 
by Lemma [3J, we can define the quadratic form 

ip(B + a(A-B))-ip(B)' 


q(ip, ip) = lim ( ip, 


a —>0 


ip (A) - <f(B) 


a 


rj 


(30) 


on the dense set D C h (The set D has been defined in Lemma [31). The operator convexity 
of if implies that A-B+a(A-B))-y(B) ^ if (A) — if(B) holds for all 0 < a < 1. Since the 
inequality is preserved by the limiting procedure a —» 0 we conclude that q is positive, ft is 
an easy exercise to check with the methods used in the proof of Lemma [dj that on D, the 
operator 


if (A) - if(B) - a(A - B) 

b f 1 2 (A-B) 2 


i 


A 


A J 0 1 +A(1 -2 B)+t 


(A-B) 


1 + A(1 - 2 B) + t 


dt 


(31) 

d/i(A), 
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is well-defined, symmetric and due to the previous reasoning also positive [compare with 
Eq. (TT6|) ] . Again by Lemma [3], its associated quadratic form is q. The theorem on the 
Friedrichs extension tells us that q is closable and that its closure (q, Q(q)) is the quadratic 
form of a unique self-adjoint operator ( T , T>(T)) whose domain D(T) is contained in the form 
domain Q(q) of q, see 6, Theorem X.23]. Additionally, T is positive. Having the Friedrichs 
extension at hand, we can define the right hand side of Eq. (J7J) to be the trace of T. To 
that end, we restrict our attention to bases {ep} , f = i of h with ep e Q(q) for all (3 G N and 
define Tr(T) = ^^pLi q(ep, ep). Of course, this definition does not depend on the choice of 
the basis, ft yields the usual notion of trace when T is trace-class and gives Tr(T) = +oo 
otherwise. If B has an eigenvalue at a point of discontinuity of <p' with A — B ^ 0 on 
the corresponding eigenspace, Lemma [3] suggest to define the trace of the right hand side of 
Eq. dZD t° be +oo. This goes hand in hand with the definition of Lewin and Sabin mentioned 
in the beginning of the introduction. 


Having these prerequisites at hand, we come to the main part of our proof. If B has an 
eigenvalue at a point of discontinuity of ip' and (A — B) ^ 0 on the corresponding eigenspace 
then both sides of Eq. (J7|) equal +oo. For the right hand side this has been discussed in the 
previous paragraph while for the left hand side, this can be seen by choosing P\ such that 
the eigenspace of the just mentioned eigenvalue lies in its range. Hence, we can exclude this 
case in what follows. The key point of our proof is the more explicit formula Eq. (JHJ) for the 
trace of the operator ip(A) — <p(B) — ^p>(aA + (1 — a)B) | a=0 which we derive now. Using 
Eq. (ITT]) . the operator ip(A) — (p(B) can be written as 


<-p{A) — <p{B) = a(A — B) 

b f 1 f 2{A - B ) ln(l + A(1 - 2A)) - ln(l + A(1 - 2 B)) 


(32) 




A 


A 2 


d/i(A). 


In the next step, we write the difference of the two logarithms in Eq. (j32l) with the help 
of the formula ln(x) = J 0 °° (-A_ _ as an integral over resolvents. When we add the 

explicit representation for -^<p(aA + (1 — a)B) | Q= o that has been derived in Lemma [3] and 
apply the resolvent identity twice, we arrive at the formula 


(^(A) - (p(B) - -^p{olA + (1 - ot)B) | q=0 = 


( 33 ) 


2b 


;Q 


Q- 


’-1 Jo 


1 + A(1 - 2B) + t 1 + A(1 - 2A) +1 1 + A(1 - 2 B) + t 


-d t d/i(A), 
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where we have introduced the shortcut Q = {A — B). Taking the trace on both sides, we 
can commute it with the integrals because the integrand is a positive operator and obtain 
Eq. (151). Hence, we have proved Lemma HJ 

Now let {P n }^Li be an increasing sequence of finite-dimensional projections that converges 
to 1 in the strong operator topology. Because for matrices the two ways of writing the 
relative entropy are the same (see Remark HD we have the formula 

/ I roc 

J Tr [. R(B n )Q n R(A n )Q n R(B n )] d t d/i(A) (34) 

with A n = P n AP n and so on. We will first show that lim^oo Tr [R(B n )Q n R(A n )Q n R(B n )\ = 
Tr [R(B)QR(A)QR(B)] and then argue why we can interchange the limit with the two 
integrals. 

Let 77i > 1. In order to be able to restrict the trace on the right hand side of Eq. (134)) to a 
finite-dimensional subspace, we first investigate 

Tr [(1 - P m )R{B n )Q n R(A n )Q n R(B n ){ 1 - P m )] (35) 

< Tr [(1 - P m )R(B n )QlR(B n )( 1 - P m j\ * 

< Tr [(1 - P m )R(B n )P n Q 2 P n R(B n )( 1 - P m ) 1 --j——. 

Let us for the moment assume that Q is Hilbert-Schmidt which implies that it can be written 
as Q = Yl'aLi Qp\ / 4’p)(' l l’p\ with q | < c>o. The case when this does not hold true is taken 

care of at the end. Using the cyclicity of the trace, we write 

Tr [(1 - P m )R{B n )P n Q 2 P n R{B n ){ 1 - P m )] = Tr [QP n R(B n )( 1 - P m )R(B n )P n Q} (36) 

k 

= QP n R{B n ){ 1 - Pm)R(B n )P n Q ^ a ) 

a=l 

oo 

+ ^ (^a, QP n R(B n )( 1 - Pm)R{B n )P n Q ^ a ). 

a=k-\-1 

The term in the last line on the right hand side of Eq. (1361) can be bounded uniformly in n 


13 




as the next calculation shows, 


(-0a, QP n R(B n )( 1 - P m )R(B n )P n Qi /} a ) 

a=k -\-1 


oo 

E q 2 a ^ a ,P n R(B n )( 1 

a=fc+l 



(37) 

P m )R(B n )P n i > Q ) 


The right hand side of Eq. (137|) goes to zero as fc tends to infinity for all 
— 1 < A < 1 and t > 0 due to the assumptions on Q. On the other hand, 
SLr & a ,QP n R{B n ){l - P m )R{B n )P n Q ^ a ) -> Eti (Vu,Qi^(5)(l - P m )R{B)Q^ a ) for 
n —> oo because the sum is finite and the operator in the middle is convergent in the strong 
operator topology, see [s), Theorem VIII.20]. When we consider Eq. (j35j) again and take the 
limit n —» oo followed by the limit k —> oo, we arrive at 


lim Tr [(1 - P m )R(B n )Q 2 R(B n )(l - P m )] (38) 

n—>oo 

= Tr [(1 - P m )R(B)Q 2 R(B)(l - P m )\. 

Let us denote the left hand side of this equation by S(n, m) and the right hand side by 5(m). 
By construction, liiUm^oo S(m) = 0 holds. Using this result, we easily get the following two 
inequalities 

Tr [. R(B n )Q n R(A n )Q n R(B n )] < Tr [. P rn R(B n )Q n R(A n )Q n R(B n )P m ] + 5(n,m), (39) 

Tr [ R(B n )Q n R(A n )Q n R(B n )] > Tr [P m R(B n )Q n R(A n )Q n R(B n )P m \, 

where 8(n,m) = 5(n,m)( 1 — |A| + i) _1 . Taking first the limit n — » cxd and then the limit 
m —> oo in the above equations, we conclude that lim lWOO Tr [R(B n )Q n R(A n )Q n R(B n )] = 
Tr [R(B)QR(A)QR(B)] for all -1 < A < 1. 


The next step in the proof is to interchange the limit n —» cxd and the integrals. Let us start 
with the integral over t. Since we only need a bound for almost every A to apply dominated 
convergence we can assume that —1 < A < 1. Linder these conditions a dominating function 
is easily constructed because 

Tr[R(B n )Q n R(A n )Q n R(B n )] < ( * ) \\Q\\l (40) 
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Hence, we have shown that 

pi poo 

/ / Tr [R(B)QR(A)QR(B )] dtdn(X) 


( 41 ) 


'-1 J o 


'-1 


lim 

n—too 


Tr [R(B n )Q n R(A n )Q n R(B n )] dt ) d/x(A). 


To interchange the limit with the hrst integral, we have to argue more carefully and use the 
monotonicity of the relative entropy. With similar but somewhat easier arguments than the 
ones used to prove Lemma El we can show that 

— ln(l + A(1 — 2A n )) + ln(l + A(1 — 2 B n )) (42) 

+ ^ l n (l + A(1 — 2(aA n + (1 — aB n ))))\ a=0 ^ 

poo 

= / R(B n )Q n R(A n )Q n R(B n )dt. 

J o 

Now we take the trace on both sides of the above equation. On the right hand side, we 
interchange the trace with the integral over t and use the result from Eq. (j4TT) to arrive at 

pi poo 

(43) 


Tr \R(B)QR(A)QR(B)] d«d^(A) 


'-1 JO 


'-1 


, lim Tr 

A 2 V n—>-oo 


ln(l + A(1 — 2A n )) + ln(l + A(1 — 2B n )) 


+ ^ hi(l + A(1 — 2(aA n + (1 — aB n )))) | a=0 J d/i(A). 

Since x i —> (—ln(a;)) , = —A is operator monotone, the integrand on the right hand side of 
Eq. (j43|) is monotone in n by Theorem [0 On the other hand, from what we said above, we 
know that it converges pointwise for all — 1 < A < 1 as n tends to infinity. Therefore, the 
interchange of the limit n —>■ oo and the integral over A is justified by monotone convergence. 
This completes the proof for the case when Q = (A — B) is Hilbert-Schmidt. 


Now assume that {A — B) is not Hilbert-Schmidt. From [L Theorem 3], we conclude that 
there is a constant C > 0 such that 


lim n(A n ,B n )>C\\(A-B)\\ 2 2 = oo. 


On the other hand 


Tr [R(B)QR(A)QR(B)] > Tr [R(B)Q 2 R(B)] 


4 + t 


= oo, 


(44) 


(45) 
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where the equality on the right hand side is justified by the fact that 5(5) is bounded and 
invertible for all — 1 < A < 1. ffence, the right hand side of Eq. (J7|) equals +oo as well. This 
completes the proof of Theorem [3J 


III. PROOF OF THEOREM 0 


As in the proof of Theorem [3l we start with a Lemma in order not to interrupt the main 
argumentation. Throughout the whole section we assume that the b in Eq. (fill is strictly 
positive. This is reasonable because otherwise the relative entropy equals zero. 


Lemma 4. Assume that (A — B) and ip'(B)(A — B) are trace-class. Then 


oo „i , 

/ M 

/3=m -1 ' 


2 B 


1 — A(25 — 1) 


I dfi(X) < oo, 


where (A - B) = Y^f =1 qp\ 03) with 


< oo. 


(46) 


Proof. The integral representation of Eq. (HU . tells us that 

rl 2B-1 


(p'(B)(A — B) = a(A — B) + b 


d/i(A)(A-5). 


(47) 


/_! 1 — A(25 — 1) 

Because (A — B) is trace-class by assumption we know that the second term on the right 
hand side of the above equation is trace-class as well. And due to the polar decomposition, 
there exist two partial isometries U and V such that 


2B-1 


-d/i(A ){A-B) = U 


2 B 


Li 1- X(2B- 1) 


d/i(A) 


\A-B\V. (48) 


J_ x 1 - A(2 B - 1) 

Since the set of all trace-class operators is a two-sided ideal in the algebra of bounded 
operators C{h) we conclude that the term on the right hand side of Eq. (jj8j) without U 
and V is trace-class as well. We decompose the operator B in the way B = B 1(B < 
1/2 ) + B 1 (B > 1/2) to see that the absolute value of the integral on the right hand side of 


Eq. jSD is given by 
u 

00 > Tr 


/- 1 l-Apfl- l) d M A ) = 1-1 
25-1 


2B—1 


1—A(2B—1) 


d/i(A). Therefore, 


1 - A(2 B - 1) 


d/x(A) 


\A — B\ 

OO 

= E 


(49) 


M "0/3, 


/3=1 


'-1 


25-1 


1 - A(25 - 1) 


d/i(A). 


This is what we intended to show. 


□ 
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Having Lemma [4] at hand, the proof of Theorem [H that is the proof of the identity 
Tr [ip'(B)(A — B)\ = Tr (aA + (1 — a)B) | ], is in principle a straightforward com¬ 

putation that exploits the cyclicity of the trace. We start by inserting the integral represen¬ 
tation of tp' [Eq. (HOD ] into Tr | [<p'(B)(A — B)} to obtain 

'' 2B — 1 


Tr [ip'(B)(A - B)\ = a Tr(H - B) + b Tr 

OO «1 

,Ti(A- B) + bJ2 \ 

8=1 J ~ l 


_! 1 - A(2 B - 1) 
2 B 


= a 


98 *l>8> 


(A-B)dn(X) 
- 1 


(50) 


1 - A(2 B - 1) 


Vu d/i(A). 


Here, denotes the complete set of eigenfunctions of the self-adjoint operator (A — B ). 

We wish to interchange the sum over f3 and the integral over A on the right hand side of the 
above equation. Using the bound 


Q8 ^8i 


2B-1 


1 - A(2 B - 1) 


i>8 


< M V>/3, 


2B — 1 


1 — X(2B — 1) 


8 


(51) 


and Lemma HI this is justified by an application of Fubini’s theorem. On the other hand, 
the operator [A — B) is trace-class as long as — 1 < A < 1 because (A — B) is 


trace-class and 

»i 

Tr ' 


2B—1 


1—A(2B—1) 

2B — 1 


is bounded. We conclude that 


J_ ± 1 - A(2 B - 1) 

Using the identity 

1-2 B 


(A - B)dpi(X) 


= /> 


2 B 


1 - A(2 B - 1) 


l + A(l-2 B) A A 

Eq. (152D can be written as 

25-1 


1 +A(1 -2 B)+t 


(A-B) 


dt, 


d/i(A). (52) 


(53) 


Tr 




1 - A(2 B - 1) 


(A-B) dfi(X) 


(54) 


1 

2 


Tr 




2 

A 


2 

A 


l + A(l-2 B)+t 


dt}(A- B) 


d/i(A). 


< 


|A| +1 


\\A-B II, 


(55) 


With the bound 

E (^( 1 + a( i 1 om,J ( A - B)te 

which holds for all — 1 < A < 1, we argue like above with Fubini that the trace can be 
interchanged with the integral over t. Now we can use the cyclicity of the trace to arrive at 
f 1 2 B - 1 


Tr 


Li 1 - A(2 B - 1) 


Qdp,(X) 


(56) 


i-i 


A 


A 


Tr 


:Q: 


.1 + A(1 - 2B) +1 + A(1 - 2 B) + t 


dt ) d/i(A). 
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To shorten the writing, we have used the shortcut Q = {A — B). Except for the fact that 
the trace is inside the integral, this is what we wanted to obtain (compare with the result 
of Lemma [3D . 


Now we have to argue why we can take the trace out of the integral again which would 
complete the proof. By Q + and we denote the positive and the negative part of the 
operator Q = (A — B), respectively. First, we want to show that the above term with Q 
replaced by Q + or by Q-, that is 


'-i 


2 2 
A TrQ± -A 


Tr 


;Qz\- 


dfj d/i(A), (57) 


1 + A(1 - 2B) + t ^ 1 + A(1 - 2 B) + t_ 

is finite. To that end, we use the cyclicity of the trace to bring the two resolvents (1 + A(1 — 
2 B) + f) -1 together again, the bound from Eq. (15H [with Q = (A — B) replaced by Q± 
on the left hand side] and Lemma [4] another time. In other words, we go from Eq. (1561) to 
Eq. (l5Tf in backward order with Q = (A — B) replaced by Q±. This shows the finiteness of 
Eq. §ZD. 


Next, we go back to Eq. (1571) and split the integral over A into three parts, one from —1 to 
— 1/2, one from —1/2 to 1/2 and a last one from 1/2 to 1. The integral from —1/2 to 1/2 is 
easy to treat. We look at Eq. (j57l) again, adjust the boundaries of the integral over A to run 
from —1/2 to 1/2 and evaluate the trace in an arbitrary basis. Like in the proof of Lemma EU 
we show that there is no singularity at A = 0. Together with the standard estimates used 
in the proof of Theorem [3] this implies that the expression inside the integral over A can be 
bounded by a constant. Since /i is a probability measure this is enough to apply dominated 
convergence and interchange the sum coming from the trace and the integral over A. The 
fact that this works for any basis, shows that 

C "I f 1 + A(1 — 2B) + ~A ± 1 -I- A(1 — 2B) + < 58 > 

is trace-class and that for this term the trace and the integral can be interchanged. 


In the next step, we investigate the integral from 1/2 to 1, that is Eq. (J57]) with the adjusted 
integral boundaries. Since 



It Q±dn(\) < 4 IIQH, 


(59) 
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the first term inside the integral over A can be integrated separately. Additionally, the trace 
and the integral over A can be interchanged for this term as well. This implies that also 


Tr 


1/2 


:Q±- 


d/d/i(A) 


(60) 


A .In “ 1 + A(1 - 2B) + i"*n + A(1 - 2B) + f_ 

is hnite. Since the operator inside the trace is positive we can apply Fnbini to interchange the 
trace with the integral over t and afterwards with the integral over A. The same arguments 
work for the integral from —1 to —1/2. Putting all this together, we have shown that 


i-i 


2 2 

A^A 


Tr 


:Q- 


= Tr 


'-l 


2 2 

-Q-- 
a A 


1 + A(1 - 2B) + t 1 + A(1 - 2 B) + t 
° 1 „ 1 


dt\ d/i(A) 


(61) 




1 + A(1 - 2B) +1 + A(1 - 2 B) + t 


d£) d/i(A) 


which together with Eq. (J50j) and Eq. (1561) implies that 


Ti' — B)] 


(62) 


= Tr 


«Q ~ T I -T 


’-1 


2 Q 


Q- 


A A J 0 1 + A(1 — 2B) + t 1 + A(1 — 2B) + t 


d t > d/i(A) 


holds. In particular, the operator on the right hand side of Eq. (I62j) is trace-class. Since 
(p'(B)(A — B) is trace-class by assumption we know that B cannot have eigenvalues at 
points of discontinuity of ip 1 with (A — B) ^ 0 on the corresponding eigenspaces. From 
this we conclude with the help of Lemma [3] that ( aA + (1 — a)B ) | Q= o can be defined 
as a semibounded quadratic form on D. Also on D, it is the associated quadratic form of 
the operator under the trace on the right hand side of Eq. (l62lh see again Lemma 3. This 
operator is bounded and hence we can extend ( aA + (1 — a)B ) | Q=0 to a bounded and 

symmetric quadratic form on all of h whose associated self-adjoint operator is the operator 
under the trace on the right hand side of Eq. (l62lh Hence, we have shown that 


Tr [y'{B){A-B)\ = Tr 
This concludes the proof of Theorem [4] 


— v (aA + (a~a)B) | a=0 


(63) 
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